11 research outputs found
Power-law distributions in binned empirical data
Many man-made and natural phenomena, including the intensity of earthquakes,
population of cities and size of international wars, are believed to follow
power-law distributions. The accurate identification of power-law patterns has
significant consequences for correctly understanding and modeling complex
systems. However, statistical evidence for or against the power-law hypothesis
is complicated by large fluctuations in the empirical distribution's tail, and
these are worsened when information is lost from binning the data. We adapt the
statistically principled framework for testing the power-law hypothesis,
developed by Clauset, Shalizi and Newman, to the case of binned data. This
approach includes maximum-likelihood fitting, a hypothesis test based on the
Kolmogorov--Smirnov goodness-of-fit statistic and likelihood ratio tests for
comparing against alternative explanations. We evaluate the effectiveness of
these methods on synthetic binned data with known structure, quantify the loss
of statistical power due to binning, and apply the methods to twelve real-world
binned data sets with heavy-tailed patterns.Comment: Published in at http://dx.doi.org/10.1214/13-AOAS710 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Recommended from our members
Effects of Multilayer Network Interactions on Neural Network Dynamics
Networks of excitable units are found in varied disciplines such as social science, neuroscience, genetics, epidemiology, etc. Previous studies have shown that some aspects of network function can be optimized when the network operates in the 'critical regime', i.e., at the boundary between order and disorder where the statistics of node excitations correspond to those of a classical branching process. In this thesis, we introduce and study a mathematical model of a neural network with the goal of understanding the long-standing problem of determining the mechanisms by which a neural network regulates its activity so as to operate in the critical regime. In particular, we study the dynamics of a two-layered network model consisting of an excitable node network and a complementary network that supplies resources required for node firing. More specifically, we study the dynamics of an excitable neural network consisting of neurons (nodes) connected via synapses (edges). Synaptic strengths are mediated by resources supplied by the complementary glial cell network. Resources from the bloodstream are supplied to the glial network at some fixed rate, resources transport diffusively within the glial cell network and ultimately to the synapses, and each time a presynaptic neuron fires the resources for all outgoing synapses get consumed at some fixed rate. We show that this natural and very compelling mechanism for feedback control can stabilize the critical state. Additionally, the neural network can learn, remember and recover the critical state after learning. The critical state is characterized by power-law distributed avalanche sizes that are robust to changes in the supply, consumption and diffusion rates. Finally, we show that our findings are fairly robust to heterogeneity in model parameters or network structure
Findings of the IWSLT 2022 Evaluation Campaign.
The evaluation campaign of the 19th International Conference on Spoken Language Translation featured eight shared tasks: (i) Simultaneous speech translation, (ii) Offline speech translation, (iii) Speech to speech translation, (iv) Low-resource speech translation, (v) Multilingual speech translation, (vi) Dialect speech translation, (vii) Formality control for speech translation, (viii) Isometric speech translation. A total of 27 teams participated in at least one of the shared tasks. This paper details, for each shared task, the purpose of the task, the data that were released, the evaluation metrics that were applied, the submissions that were received and the results that were achieved
Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing
AbstractWe investigate how humans perform the task of dubbing video content from one language into another, leveraging a novel corpus of 319.57 hours of video from 54 professionally produced titles. This is the first such large-scale study we are aware of. The results challenge a number of assumptions commonly made in both qualitative literature on human dubbing and machine-learning literature on automatic dubbing, arguing for the importance of vocal naturalness and translation quality over commonly emphasized isometric (character length) and lip-sync constraints, and for a more qualified view of the importance of isochronic (timing) constraints. We also find substantial influence of the source-side audio on human dubs through channels other than the words of the translation, pointing to the need for research on ways to preserve speech characteristics, as well as transfer of semantic properties such as emphasis and emotion, in automatic dubbing systems
FINDINGS OF THE IWSLT 2022 EVALUATION CAMPAIGN
The evaluation campaign of the 19th International Conference on Spoken Language Translation featured eight shared tasks: (i) Simultaneous speech translation, (ii) Offline speech translation, (iii) Speech to speech translation, (iv) Low-resource speech translation, (v) Multilingual speech translation, (vi) Dialect speech translation, (vii) Formality control for speech translation, (viii) Isometric speech translation. A total of 27 teams participated in at least one of the shared tasks. This paper details, for each shared task, the purpose of the task, the data that were released, the evaluation metrics that were applied, the submissions that were received and the results that were achieved